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Abstract 



Sensor networks aim at monitoring their surroundings for event detection and object tracking. But, due to 
failure, or death of sensors, false signal can be transmitted. In this paper, we consider the problems of distributed 
fault detection in wireless sensor network (WSN). In particular, we consider how to take decision regarding fault 
detection in a noisy environment as a result of false detection or false response of event by some sensors, where 
the sensors are placed at the center of regular hexagons and the event can occur at only one hexagon. We propose 
fault detection schemes that explicitly introduce the error probabilities into the optimal event detection process. 
We introduce two types of detection probabilities, one for the center node, where the event occurs and the other 
one for the adjacent nodes. This second type of detection probability is new in sensor network literature. We 
develop schemes under the model selection procedure, multiple model selection procedure and use the concept of 
Bayesian model averaging to identify a set of likely fault sensors and obtain an average predictive error. 

ON ' Keywords: Event Detection, Wireless Sensor Network, Multiple Model Selection, Bayesian Model Averaging. 

r- : 
■ 

^h" ' 1 Introduction 

O : 

Traditional and existing sensor-actuator networks use wired communication, whereas wireless sensor networks (WSN) 
provide radically new communication and networking paradigms and myriad new applications. The wireless sensors 
' have small size, low battery capacity, non-renewable power supply, small processing power, limited buffer capacity 
and low-power radio. They may measure distance, direction, speed, humidity, wind speed, soil makeup, temperature, 
rN ' chemicals, light, and various other parameters. 

Recent advancements in wireless communications and electronics have enabled the development of low-cost WSN. 
A WSN usually consists of a large number of small sensor nodes, which are equipped with one or more sensors, some 
processing circuit, and a wireless transceiver. One of the unique features of a WSN is random deployment in 
inaccessible terrains and cooperative effort that offers unprecedented opportunities for a broad spectrum of civilian 
and military applications, such as industrial automation, military surveillance, national security, and emergency 
health care [21 [23 H] • Sensor Networks are also useful in detecting topological events such as forest fires [TU] . 

Sensor networks aim at monitoring their surroundings for event detection and object tracking [5J[T7]. Because of 
this surveillance goal, coverage is the functional basis of any sensor network. In order to fulfill its designated tasks, a 
sensor network must fully cover the Region of Interest (ROI) without leaving any internal sensing hole [3,5,6,9]. So 
far, a number of movement-assisted sensor placement algorithms have been proposed. An exclusive survey on these 
topics is presented by Li et al. [Tl]. On the other hand sensor could die or fail at runtime for various reasons such 
as power depletion, hardware defects, etc. So, even after the ROI is fully covered by the sensors, wrong information 
can be communicated by some sensors, or sensors may fail to detect the event due to noise or obstructions. Chen 
et al. [5] have proposed a distributed localized fault detection algorithm for WSN, where each sensor identifies its 
own status to be either good or faulty and the claim is then supported or reverted by its neighbors. The proposed 



'Corresponding author. Email: mrinal.nandil@gmail.com 



1 



algorithm is analyzed using a probabilistic approach. Sharma et al. [5T] have characterized the different types of 
fault and proposed a different algorithm for fault detection considering different types of fault. Some of the methods 
are statistical, like, using histogram, etc. Both the work can only detect the faulty sensors, but not the event. 

One of the important sensor network applications is monitoring inaccessible environments. Sensor networks are 
used to determine event regions and boundaries in the environment with a distinguishable characteristic |13[ [TJ I19| . 
The basic idea of distributed detection [55] is to have each of the independent sensors make a local decision (typically, 
a binary one, i.e., an event occurs or not) and then combine these decisions at a fusion sensor (the sensor which 
collects the local information and takes the decision), or at a base station to generate a global decision. 

1.1 Our Motivation 

In this paper, we are interested in one particular query: determining event in the environment (i.e., ROI) with 
a distinguishable characteristic. We assume the ROI to be partitioned into suitable number of congruent regular 
hexagonal cells (i.e., we can think ROI as a regular hexagonal grid). This physical structure of ROI is not a 
requirement for the theoretical analysis, we can do the similar analysis with other structure also. Suppose that 
sensors are placed a priori at the center (which are known as nodes) of every hexagon of the grid. We assume that 
the sensors are connected to its adjacent sensor nodes in the sense that a hexagon will be strongly covered by its 
center node and weakly covered by the adjacent nodes. If event occurs in the hexagon where a particular sensor lies, 
then that particular sensor can detect the event with a greater probability whereas, if event occurs in any adjacent 
hexagon, then the particular sensor can detect the event with a lesser probability. Hence, only one node (center 
node of the event hexagon) can detect the event hexagon with greater probability, say p\, and adjacent nodes (six 
for interior nodes and less for boundary nodes) can detect the event hexagon with lesser probability, say P2, with 
Pi > P2- We assume that no other sensor can detect the event hexagon. In this paper, unlike the previous works, we 
assume that if the event occurs then it occurs at only one hexagon of the grid which will be known as event hexagon 
and there is no fusion sensor. All sensors can communicate with the base station and the base station takes the 
decision about the query. As an example, consider a network of devices that are capable of sensing mines or bombs, 
if we assume that a few mines or bombs may be placed on a particular area of ROI. Information from these devices 
can be sent to a nearby police station, or a central facility. Then, an important query in this situation could be 
whether a particular hexagon is the event hexagon or not (i.e., mines or bombs are placed or not). 

One fundamental challenge in the event detection problem for a sensor network is the detection accuracy which 
is disturbed by the noise associated with the detection and the reliability of sensor nodes. A sensor may fail to 
detect the event due to natural obstruction or any other causes. After detecting the event, a sensor can send false 
message to the base station due to some technical reasons. The sensors are usually low-end inexpensive devices and 
sometimes exhibit unreliable behavior. For example, a faulty sensor node may issue an alarm even though it has 
not received any signal for event, or it cannot detect any event, and vice versa. Moreover, a sensor may be dead, in 
which case, the sensor cannot send any alarm. 

1.2 Our Contribution 

In our theoretical analysis, the sensor fault probabilities are introduced into the optimal event detection process. We 
apply model selection approach, multiple model selection approach and Bayesian model averaging methods |12l 118] 
to find a solution of the problem. We develop the schemes using the model selection technique. We calculate different 
error probabilities and find some theoretical results. 

In all previous works, the authors assume only one detection probability. We introduce two detection probabilities, 
Pi and pi- one for the center node and other for the adjacent nodes. Even if the center node may fail to detect 
the event, the adjacent nodes may detect the event, and vice versa. We consider these probabilities and show 
that, in various situations, the adjacent nodes play key role to detect the event. One can introduce more detection 
probabilities and analyze the situation in similar manner. 

The parameters p\ and P2, the detection probabilities of a sensor, and error probabilities (see Section @| cannot 
be estimated from the real life situations, but need to be estimated beforehand by some experimentation. The prior 
probabilities of various events also cannot be estimated, but may be known in some cases. Finally, we calculate the 
error probabilities numerically for some values of the parameters of our model and make some concluding remarks 
analyzing the results. 
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Figure 1: Nodes placed in centers when ROI partitioned in to regular hexagons 




2 Previous Work 

Lou et al. [T5| consider two important problems for distributed fault detection in WSN: 1) how to address both the 
noise-related measurement error and sensor fault simultaneously in fault detection and 2) how to choose a proper 
neighborhood size n for a sensor node in fault correction such that the energy could be conserved. They propose a 
fault detection scheme that explicitly introduces the sensor fault probability into the optimal event detection process. 
They show that the optimal detection error decreases exponentially with the increase of the neighborhood size. 

Krishnamachari and Iyengar [13] propose a distributed solution for canonical task in WSN (i.e., the binary 
detection of interesting environmental events) . They explicitly take into account the possibility of sensor measurement 
faults and develop a distributed Bayesian algorithm for detecting and correcting such faults. 

Nandi et al. [TB] consider the problem of distributed fault detection in wireless sensor network (WSN) , where the 
sensors are placed at the center of a particular square (or hexagon) of the grid covering the ROI. They proposed 
fault detection schemes that explicitly introduce the error probabilities into the optimal event detection process. 
They developed the schemes under the consideration of Neyman-Pearson hypothesis test and Bayes test. They also 
calculate type I and type II errors for different values of the parameters. 

In almost all the previous works, except |16j . authors assume that event occurs over a region and there are fusion 
sensors that collect the information locally and take a decision. Since they do not introduce the concept of base 
station there is no concept of response probability. Also, they assume informations are spatially correlated. Unlike 
the previous work, in this paper, we assume that if event occurs then it occurs at only one cell of the ROI and there 
is no fusion sensor. All the sensors send information to the base station. We introduce the probability model in two 
different stages; firstly, when a sensor detects the event and, secondly, when a sensor sends the message to the base 
station. In the previous works, only one type of detection probability has been introduced to simulate the different 
error probabilities for some specific values of the parameters. In this paper, we introduce two different detection 
probabilities and obtain analytically the exact test and estimate the error probabilities by simulation. In almost all 
the previous works, authors assume the ROI to be a square grid. The hexagonal grid is better in the sense that 
minimum number of sensors are required to cover the entire ROI |23| . 

3 Statement of the problem and Assumptions 

In this section, we describe the problem in more specific terms and state the assumptions that we make. 
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Sensors are deployed, or manually placed, over ROI to perform event detection (i.e., to detect whether an event of 
interest has happened or not) in ROI. If sensors are deployed from air then, using actuator-assisted sensor placement 
or by movement-assisted sensor placement, sensors are so placed that sensor network covers the entire ROI. This 
ROI is partitioned into suitable number of regular hexagons (i.e., we can think of the ROI as a regular hexagonal 
grid) , as shown in Figure Q] Sensors are placed a priori at every center (which are known as nodes) of the regular 
hexagons. Sensors have two detection probabilities. The sensor network covers the entire ROI and there is only one 
event hexagon, as discussed before. 

Each sensor node determines its location through beacon positioning mechanisms [1] or by exploiting the Global 
Positioning System (GPS). Through a broadcast or acknowledge protocol, each sensor node is also able to locate the 
neighbors within its communication radius. Sensors are also able to communicate with the base station. Base station 
will take the decision. In this paper, we assume that, event occurs at one particular hexagon of the grid which will be 
known as event hexagon or event does not occur (in that case we say ROI is normal) . All sensors can communicate 
with the base station and base station takes the decision by combining the information received from all the sensors. 

There are two phases in the whole process. The first one is detection phase, when the sensor at the center of a 
regular hexagon tries to detect the event. The sensor at the center of the event hexagon can detect the event hexagon 
with greater probability p\ and the sensors at the adjacent nodes (see Figure [T]) can detect the event hexagon with 
lesser probability p2 ■ We also assume that there is a prior probability that a particular hexagon is an event hexagon. 
The next phase is response phase, in which sensors send message to the base station. Even if the event hexagon is 
detected by a sensor, it may not respond (i.e., send message to the base station that no event occurred in that cell 
and the neighboring cells due to some technical fault) with some probability; then we say that the sensor is a faulty 
sensor. Conversely, if event hexagon is not detected, or there is no event hexagon at all (i.e., ROI is normal), then 
also a faulty sensor can send the wrong information to the base station with some probability. A sensor is said to be 
a dead sensor if the sensor does not work. A dead sensor sends no response in either cases. 

Each sensor sends information to the base station. As the sensors may send wrong information, the base station 
takes the important role in identifying the event hexagon. Base station will collect all the information and take a 
decision about the event hexagon according to a rule which we have to find out. Our job is to find a rule for the 
base station such that base station works most efficiently. 

3.1 Notations and Assumption 

Our problem is to develop a strategy for the base station to take decision about event hexagon (i.e., which hexagon 
of the ROI is the event hexagon, if at all). Let R be the set of all nodes. For N E R, define B(N), as the set of 
adjacent node(s) of N and k(N) be the number of adjacent node(s) of N. Hence, < k(N) < 6. Call a node N 
interior if k(N) = 6. Let Sn be the sensor which is placed at the node N and Hn be the hexagon where the node N 
is placed (i.e., N is the center of Hn)- For N G R, let Xn denote the true status of the node N . That is, Xn = 1 
if event occurs at Hn, and otherwise. Also define Yn = if Sn detects no event, and 1 if Sn detects the event in 
Hm or H n' ? for N' £ B(N). Finally define Zn = if Sn does not respond, i.e., the sensor informs the base station 
that event does not occur at Hn or Hn' for N' £ B(N), and Zn = 1 if Sn responds, i.e., the sensor Sn informs the 
base station that the event has occurred in Hn or Hn>, for N' £ B(N). 

Now we make one natural assumption that, once detection phase is completed, response of a sensor depends only 
on what it detects but not on whether the event has actually occurred or not, i.e., P(Zn = fc|Yjv, Xjv) = P{Zn = 
fc|Y/v), for k = 0, 1. We also assume that the sensors work independently and identically. 

Since we assume that there is at most one event hexagon, YIner -^n = 1 or 0. 
The possible true scenarios are, therefore, represented by the following \R\ + 1 different models: 

M : (X N = for all N € R), 

and, for each N £ R, 

M N ■ (X N = 1 and X N > = for all N' G R\ N). 
Let Pr(A4o) = P(ROI is normal) = p norm 
and, for all N £ R, Pt(M.n) — Pr(event occurs at the hexagon Hn) — Pn- 

In particular, we may assume Pat's to be same for all N. We denote any probability under the model A4q as Pmo(') 
and under the model Mn as Pm n {')- 

We also make the followings assumptions: 

• For all N £ R, P Mo (Y N - 1) - and P Mn (Y n = 1) = pi. 
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• For all N' E B{N),P Mn {Y n , = 1) = p 2 , and 

for all N' E R\ [B(N) U {N}],P Mtf (Y N , = 1) = 0. 

• For all N E R,P{Z N = l\Y N = 1) = p c and P(Z N = 1\Y N = 0) = p w . 

• Zm and Yn< are independent for N ^ N' . 

• The responses from different nodes are independent under a particular model, i.e., Z/v's are independent under 
Mn' for a fixed N' E R. 

4 Theoretical Analysis of fault Detection 

In this section we discuss some theoretical results. In real situations, \R\ may be very large. Given the network of 
the sensor nodes and some prior knowledge about the nature of event, one may have fairly good idea about the set of 
feasible regions for the event. Formally, instead of all possible models, one may be able to restrict to a set containing 
all the feasible models. For example, if the event is known to take place in a particular region, we can restrict our 
models accordingly. 

4.1 Model Selection Approach 

For all N E R, P M „(Z N = 1) 

= Pm (Zn = l\Y N = 0)P Mo (Y N = 0) + P Mo {Zn = 1\Y N = l)P Mo (Y N = 1) 
= P(Z N = 1\Y N = 0)P Mo (Y N = 0) + P(Z N = 1\Y N = 1)P M „(Y N = l)=p w - 

Hence, under the model Mo 1 Zn follows Ber(p„,), for all N € R, and the likelihood of the data {Zn = zn, for all N 6 
R }, under the model Mo, is 

Lo = Pm„(Z n = z N , for all N E R) 

= n pi n i 1 - p™) {i ~ zn) = (p w ) i:NeRZN x a -p w )^ r {i ' zn) . 

ngr 

So In L = S^vefl z N \ap w + T, NeR (1 - z N ) ln(l -p w ). 

For any N E i?, we have Pm n (^n = 1) 
= Pm n {Zn = l\Y N = 0)P Mn {Y n = 0) + P Mn {Zn - l\Y N = 1)P Mn {Y n = 1) 
= P(Z N = 1\Y N = 0)P Mn (Y n = 0) + P(Z N = 1\Y N = 1)P Mn (Yn = 1) 
= PwO- -Pi) +PcPi =Pi(Pc -Pw) +Pw = Pi, say. 

Hence, for all N E R, under Mn, Z n follows Ber(Pi). Similarly, for all N' E B(N), under Mn, Z n > follows Ber(P 2 ), 
where P 2 = p 2 (Pc ~Pw) +Pw and, under M N , Z N > follows Ber(p w ) for all N' E R\[B(N)U{N}}. Note that P x > P 2 
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since p\ > p2- Hence the likelihood for the model A4n, given Zn> = z^i,N' E R, is 
Ln = Pm n ( z n' = zn', for all N' £ R) = 

p 1 ^(i-p 1 )(i-^)n JV , eflW P 2 ^'(i-p 2 )( 1 -^') 

X^N'eR\[B(N)u{N}] P Z W N '(1 -Pw) (1 ~ ZN,) 

= P 1 2JV (1 - P 1 )( 1 " z «) p 2 E «'e-B(jv) z N' q _ p 2 ^ N , eB{N) (l-z N ,) 

Xpw N ' eRKlB<N)UlN1]ZN ' (I - p w )^N'eR\[B(N)u{N}]( 1 - z N') _ 

Let T N = Z N'> so that ^ {1 - Z N ,) = k(N) - T N 

N'eB(N) N'eB(N) 

with the corresponding observed values denoted by 
= zn> and (1 — Zn') = k(N) — t^. 

N'eB(N) N'eB(N) 

Therefore, lnL^r = 

z N InPi + (1 - z N ) ln(l - Pi) + t N ln P 2 + (fc(iV) - t w ) ln(l - P 2 ) 

+ 51 zjvlnp u , + ^ (1 - ZN)ln(p w (l -p w )) 

N'eR\[B(N)u{N}] N'eR\[B(N)V{N}] 

p y p p i 

= lnio + z N \n — + (1 - Ziv)ln- +<Arln — + (k(N) - t N )ln- 



Pw 1 Pw Pw 1 Pit; 

Pi(i- Pu) ) P 2 (i-p w ) , , l-fi , ,., Ani i-p 2 

= In L + z N ln — — + t N ln — — + ln h k(N) ln 

p w {l-Pi) p w (l-P 2 ) l-Pw l—Pw 

= a + 6(czjv + £jv — dk(N)), say, 

where, a = ln Lo + hi , b = ln — ^ ^ttk > 0, 

ln|H^4 In 1 " 



' ' 1 and d = — ) Bl , are independent of N. 



m Pl „(l-P 2 ) m p„(l-P 2 ) 



In model selection approach, the model resulting in the maximum value of the likelihood is selected. Note that, since 
there is no parameter being estimated, this is equivalent to the well-known Akaike Information Criterion(AIC) 
Therefore, the base station will accept the model Mo if 

Y p 

= ln + b(cz N + t N - dk(N)) < 0, for all N E R. 

l — Pw 

Otherwise, as b is positive, accept the model Mn for which (azjv + — dk(N)) is maximum among all N E R- If 
values of (cz^ + — dk(N)) are equal for more than one N, then we can select one of the corresponding models 
with equal probability. If we want to maximize the likelihood for the models Mn corresponding to the interior nodes 
only, so that k(N) is fixed, then we need to maximize (czn + tjsr) among all N E R. 



4.2 Multiple Model Selection 

Instead of selecting one particular model, one may want to select more than one models with approximately similar 
log likelihood values to the maximum one. We can consider the set of models 

{Mk : ^-j- > C}, 

max N( z R L N 

where < C < 1 is a suitable constant close to 1. This C is usually chosen according to the resource available. This 
is similar to the idea of Occam's window in the context of Bayesian model selection |12| . This may be interpreted as 
the interval estimation for the true model. 
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Note that Lm is an increasing function of czm + — dk(N), as b is positive. We consider only the following set 
of models 

{M K ■ Qk > C* ■ max NeR Q N }, 

where Qn — czm + £jv — dk(N), for all N € R, with < C* < 1. In particular, if we consider the interior nodes only, 
then we consider the set of models given by 

{Mr ■ cz K + t K > C* ■ maxNeR{ cz N + *iv}}- 

We can select multiple models using some other criteria. One such may be to select all the models (one or more) 
for which the maximum value of the likelihood is attained. Let 7V ma x be the set of nodes corresponding to all 
these models, including 'N = 0' corresponding to Aio if it has the maximum value of the likelihood. Then this 
method select all the models A4n with N E A/" m ax- By another criterion, one may select the models A4n', f° r 
N' E A/" max U [^NeAT mB .^B(N)]\ that is, N' be a node in A/" max or any of the neighboring nodes of a node in Af max . 
Note that B(N) for N = is the empty set. One can combine these two types of criteria and come up with many 
others. 



4.3 Bayesian Model Averaging 

Bayesian model averaging is an effective method to solve a decision problem when there are many alternative 
hypotheses or models, which are complicated (T2]. Suppose M.\, M.2, ■ . ■ , M-h are the models considered and D 
denotes the given data. The posterior probability for model Aik is given by 

Pr(M k \D)- *MM>mM k ) 



£Pr(£>|M)Pr(M)' 

where Pr(D\A4k) denotes the probability of observing data D under the model Aik (which is essentially the likelihood 
ife under Mk) and Pr(TWfc) is the prior probability that Aik is the true model (assuming that one of the models is 
true) . 

In this work, the data D is {Z^ = zjy : N E R} and the models are Mq, A4n, N S R as defined in Section 3.2. 
Hence, The posterior probability for model AiN is 

Pr(M N \Z N =z N ,N eR)= / ' xLx 



and that for Ain is 



Eie_R Pl-Ll + PnormLo 

PnormLo 
S/eflP'^i + PnormLo 



We select the model Aio ifp norm Lo is greater than p^L^, for all N € R; otherwise, select Ai n for which pnLn is 
maximum among all N E R. Hence, if pat's are all equal, then Bayesian approach is same as the likelihood approach. 



5 Some Important Considerations and Error Probabilities 

In this section, we consider some important issues related to the problem of fault detection and the proposed 
methodology including calculation of errors (e.g., false detection, etc.) and detection probabilities. 

The following probabilities give some idea about the role of neighboring nodes, along with the center node, in 
detection, or false detection, of event. For example, Pm (Ln = 0, = 1) gives the probability of a false detection 
by the N th node, and not by the neighboring nodes, while Pm n {Tn — 6,-Zjv = 0) gives the probability of a false 
negative by the N th node, with all the neighboring nodes detecting the event. Since, given a particular model, Tjv 
and Zm are independent, calculation of such probabilities is simple as given in the following. For any N E R and 
i = 0,1,..., k(N), 

1. Pm {Tn =i 1 Z N = 0) = ( k W)pi,(l -p w ) k ^~ i+i 

2. P Mo (T N =i,Z N = l)= ( fe (fK +1 (l -p^W-i 
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3. P Mn (Tn = i,Z N = 0) = ( fe (f >)f*(l - P 2 )*=W-i (1 _ Pl) 

4. P Mn {T n = i, Z N = l) = ( fe W)P|(l - P 2 ) fe W-P 1 . 



Note that, for N e R,Pm (Ln > L ) = Pm d (lnijv > lnL ) 



P 



-Mo 



Zjv In 



Pm Z N \n 



p w {l-Pi) 

Pl(l-Pn 



„ , fb(l-p w ) , , 1-Pi 



1 - P„ 



1 - Po 

fc(AT) In > 



T N ln^^l>k(N)\n 1 - Pu 



1 ~Pv 
1 -Pv 



p^(l-p 2 ) 



1-P2 



In 



1-Pi 



which can be numerically obtained using the joint distribution of Tjv and Zjy under the model A4q. The maximum 
of these probabilities over all N gives a lower bound for the probability that a node is considered to be an event node 
when the ROI is normal. On the other hand, the sum over all N gives an upper bound for the same. Similarly, for 

N £ R, P Mn {L n < L ) = 

Pmn z n In — r 5^ + T N In — — < k(N) In — + In ■ 



p w {l-Pi) p™(i-p 2 ) v ' 1-P2 1-P1 

which can be again numerically obtained using the joint distribution of TV and Zn under the model A4n- This 
probability gives some idea about the error that, when N th node is the event node and it is not detected. 

As noted in Section BTTl we select the model A4n for which Qn is the maximum, for N £ R. The random variable 
Qn is, therefore, of some interest, the distribution of which under different models is useful in calculating many error 
probabilities. We first find the distribution of Qn under the model Mn- Note that Qn takes values ci + j — dk(N), 
corresponding to Zn = i and Tn — j, for i = 0,1, and j = 0,1,2, ... , k(N). Assume that, for convenience, the values 
of Qn for different i and j are all distinct. Therefore, for i = 0, 1 and j — 0,1, ... , k(N), 

Pm n (Qn =ci+j- dk(N)) = (PO l (l - P 1 )( 1 - i )(P 2 )^(l - p 2 )(MN)-j) 

and, P Mo (Q N = ci+j-dk(N))= (p w ) i+j (1 



For N' £ B(N), or N' e R\[B(N)U{N}], one can find Pm n ,(Qn = ci+j-dk(N)) in similar manner, although the 
calculation is very tedious as there are many sub-cases. Ideally, one is interested in probability of errors occurring 
at the level of base station. For example, the two important errors are: (1) not selecting A4q when A4q is true 
(false positive), and (2) selecting Ado when A4n is true for some N £ R (false negative). Theoretical calculation of 
these error probabilities is complicated. We, therefore, use simulation technique to estimate these and similar error 
probabilities. 



6 Simulation Study 

We consider a 32 x 32 hexagonal grid and we run the programme 10000 times. The simulation is performed using 
the C-code, and required random numbers are generated using the standard C-library. 

In our simulation study, we consider different criteria, as discussed in Sections 14. Il and l4. 21 for estimating the error 
probabilities, or equivalently, the success rate. First consider the probability of selecting A^oi when it is true. Let 
Si denote the proportion of correct detection of normal situation, when model A4q is true, using the model selection 
method of Section 4.1. That is, Si gives an estimate of Pm o (0 ^ N max and is selected by randomization). Then 
1 — Si gives an estimate of the false positive rate. 

When AAn is true for some TV S R, let S2 denote the proportion of correct decision for the event node using the 
model selection method of Section |4"7T1 so that it estimates P_\4 N (N G A/" ma x and is selected by randomization). Note 
that, for each simulation run, the event hexagon is chosen randomly so that S2 gives an average value over all N. In 
this context, this probability is same for all the interior nodes. Then, 1 — S2 gives an estimate of the corresponding 
error probability of not selecting A4n, when it is true. 
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Note that, in this problem of fault detection with a single event node, the likelihood value, for a given observed 
data configuration, may be equal for more than one models. Therefore, quite often, the maximum value of the 
likelihood may be attained by more than one model. The model selection method of Section B~T1 which selects one of 
these models randomly in such cases, may often not select the correct model. Therefore, the method of Section B~2l 
which selects more than one models having similar likelihood value, may be preferred and will have better chance of 
selecting the correct model. We now consider some of those methods in the following. 

Let us first consider the method in which all the models corresponding to the maximum value of the likelihood 
are selected. Let S3 denote the proportion of correct selection of the model A4n, when it is true, by this method. 
Then S3 estimates the probability Pm n (N £ jV max ), which is always more than or equal to the quantity estimated 
by S2, as remarked before. We also consider the method in which all the models having maximum likelihood along 
with their neighborhood models are selected. A model A4n' is a neighborhood model of the model A4n if N' is a 
neighboring node of N. If S4 denotes the proportion of correct selection of the model A4n, when it is true, by this 
method, then S4 estimates Pm n {N £ Af max U {l)N> e fj niax B(N')}). Clearly, S4 > S3 > S2. Similarly, if S5 denotes 
the proportion of correct selection of the model A4n, when it is true, by selecting all those models with likelihood 
value being more than 90% of the maximum likelihood (that is, the method of Section B~2l with C = 0.9) then S5 
estimates the probability Pm n {Ln > 0.9L max ) with L max denoting the maximum value of the likelihood. 

Table 1: Simulation of estimated probabilities for some values of the parameters 



other parameters 


Simulation of different probabilities with p c — 0.9 


pi 


P2 


Pu, 


Si 


S 2 


S 3 


N 3 


S 4 


N 4 


S 5 


N 5 


0.9 


0.0 


0.01 


0.00 


0.08 


0.81 


18.16 


0.81 


59.85 


0.82 


18.17 


0.9 


0.3 


0.01 


0.00 


0.47 


0.69 


5.44 


0.70 


14.81 


0.73 


5.70 


0.9 


0.4 


0.01 


0.00 


0.60 


0.79 


5.11 


0.78 


13.51 


0.80 


5.12 


0.9 


0.5 


0.01 


0.00 


0.70 


0.85 


4.64 


0.85 


11.50 


0.86 


4.88 


0.9 


0.6 


0.01 


0.00 


0.79 


0.90 


3.82 


0.91 


08.18 


0.92 


3.90 


0.9 


0.0 


0.001 


0.35 


0.50 


0.81 


3.17 


0.81 


7.17 


0.81 


3.18 


0.9 


0.3 


0.001 


0.36 


0.59 


0.82 


3.03 


0.83 


6.34 


0.86 


3.06 


0.9 


0.4 


0.001 


0.35 


0.67 


0.87 


2.89 


0.87 


6.18 


0.89 


3.03 


0.9 


0.5 


0.001 


0.36 


0.75 


0.90 


2.89 


0.89 


5.85 


0.93 


2.96 


0.9 


0.6 


0.001 


0.36 


0.83 


0.94 


2.74 


0.93 


5.33 


0.96 


2.79 


0.99 


0.0 


0.01 


0.00 


0.08 


0.89 


17.43 


0.90 


56.20 


0.89 


17.70 


0.99 


0.3 


0.01 


0.00 


0.51 


0.73 


5.18 


0.73 


12.98 


0.79 


5.77 


0.99 


0.4 


0.01 


0.00 


0.62 


0.81 


5.09 


0.82 


13.01 


0.84 


5.21 


0.99 


0.5 


0.01 


0.00 


0.73 


0.88 


4.97 


0.88 


12.69 


0.89 


5.04 


0.99 


0.6 


0.01 


0.00 


0.81 


0.92 


3.66 


0.92 


7.35 


0.93 


3.57 


0.99 


0.0 


0.001 


0.35 


0.57 


0.89 


3.19 


0.89 


6.69 


0.89 


3.20 


0.99 


0.3 


0.001 


0.35 


0.62 


0.88 


2.99 


0.87 


6.00 


0.90 


3.02 


0.99 


0.4 


0.001 


0.36 


0.70 


0.90 


2.91 


0.91 


5.83 


0.93 


2.97 


0.99 


0.5 


0.001 


0.36 


0.79 


0.93 


2.82 


0.94 


5.67 


0.95 


2.83 


0.99 


0.6 


0.001 


0.36 


0.84 


0.95 


2.75 


0.95 


5.31 


0.97 


2.68 


other parameters 


Simulation of different probabilities with p c — 0.99 


Pi 


P2 


Pw 


Si 


S 2 


s 3 


N 3 


S 4 


N 4 


s 5 




0.9 


0.0 


0.01 


0.00 


0.08 


0.90 


18.2 


0.89 


55.62 


0.90 


17.58 


0.9 


0.3 


0.01 


0.00 


0.54 


0.76 


5.14 


0.76 


13.08 


0.79 


5.64 


0.9 


0.4 


0.01 


0.00 


0.67 


0.85 


5.05 


0.85 


12.92 


0.87 


5.12 


0.9 


0.5 


0.01 


0.00 


0.77 


0.91 


4.86 


0.90 


12.10 


0.91 


5.02 


0.9 


0.6 


0.01 


0.00 


0.86 


0.94 


3.57 


0.93 


7.24 


0.95 


3.57 


0.9 


0.0 


0.001 


0.36 


0.57 


0.90 


3.18 


0.89 


6.61 


0.89 


3.19 


0.9 


0.3 


0.001 


0.36 


0.65 


0.88 


2.98 


0.89 


6.34 


0.92 


3.02 


0.9 


0.4 


0.001 


0.36 


0.73 


0.92 


2.87 


0.92 


5.91 


0.94 


2.91 


0.9 


0.5 


0.001 


0.35 


0.81 


0.94 


2.81 


0.94 


5.51 


0.96 


2.82 


0.9 


0.6 


0.001 


0.37 


0.88 


0.96 


2.72 


0.96 


5.24 


0.97 


2.90 


0.99 


0.0 


0.01 


0.00 


0.09 


0.98 


16.9 


0.98 


51.40 


0.98 


17.6 


0.99 


0.3 


0.01 


0.00 


0.58 


0.83 


5.66 


0.83 


14.73 


0.87 


5.69 


0.99 


0.4 


0.01 


0.00 


0.69 


0.90 


5.43 


0.91 


14.38 


0.92 


5.69 


0.99 


0.5 


0.01 


0.00 


0.80 


0.94 


4.61 


0.94 


11.19 


0.95 


4.73 


0.99 


0.6 


0.01 


0.00 


0.87 


0.96 


3.26 


0.97 


6.26 


0.96 


3.35 


0.99 


0.0 


0.001 


0.35 


0.62 


0.98 


3.20 


0.98 


6.32 


0.98 


3.28 


0.99 


0.3 


0.001 


0.36 


0.69 


0.94 


2.90 


0.94 


5.88 


0.97 


3.00 


0.99 


0.4 


0.001 


0.36 


0.76 


0.95 


2.89 


0.96 


5.59 


0.98 


2.85 


0.99 


0.5 


0.001 


0.36 


0.83 


0.97 


2.70 


0.97 


5.48 


0.98 


2.80 


0.99 


0.6 


0.001 


0.36 


0.89 


0.98 


2.68 


0.98 


5.19 


0.99 


2.69 



Suppose Ni denotes the average number of selected nodes to be searched corresponding to Si, i = 1,2, ... ,5. 
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Clearly, N% = 1 — S\ because we need no search when A^o is selected. When event occurs and we consider only one 
TV from Af max , we need at most one search (since no search is needed if A4q is selected) and we have N2 < 1. In 
our simulation, we find N2 = 1 in all the cases; that means, in simulation, A4q has not been selected when event 
occurred. Note that A3 > 1 since we consider all A's in A/max for searching. Again, as before, N4 > -/V3 > 1 > N2. 
Also, by definition, N5 > 1. Table [T] presents the different Si's and iVj's based on simulation for different values of 
Pi,P2,Pc and p w with p\ and p c taking values 0.9 and 0.99, p w taking values 0.01 and 0.001 and P2 taking values 0.0, 
0.3, 0.4, 0.5 and 0.6. The choice of p\ and p c reflects the corresponding high probability, whereas that of p w reflects 
small probability, which is desirable in a good sensor. Since the primary interest is to study the effect of detection 
by neighboring nodes, we consider P2 as (which means there is no effect of neighboring nodes) and some positive 
values less than pi . 

Note that the probability of correct detection under A4o depends only on p w . This is also evident in Table [1] 
Intuitively, if p w is high then the proportion S\ of correct detection in normal situation is low. In Table [TJ we see 
that Si is for p w = 0.01, varies from 0.35 to 0.37 for p w ~ 0.001 and varies from 0.90 to 0.91 for p w = 0.0001 (not 
shown in Table [T]). If we consider smaller value of p w then the success probability S\ will be higher. Hence p w must 
be low as the number of hexagons is high to get better results in normal situation. 

We see that the estimated false negative rate, that is an estimate of Pm n {Mo is selected), is often in our 
simulation (not shown in Table [T]). This is because, if the event occurs at A, then detection of the event by at least 
one of the nodes belonging to {N} U B(N) is highly probable. Furthermore, since the grid size is large, one of the 
node belonging to R \ ({N}U B(N)) may response wrongly, though it cannot detect the event. So, under A4n, there 
is a small probability to select ROI as normal. If we take p w and the detection probabilities p\ and P2 to be very 
small, then we may get some positive false negative rate but this is not a desired condition for a good sensor. 

From simulation, we see that, as P2 increases (for positive P2), Si values increase whereas Ni decrease. As P2 
increases, it helps to differentiate between the likelihood values resulting in lower cardinality of the set A/max and lower 
values of Aj's. However, since the neighboring nodes help to detect the event, the success probability increases. From 
simulation, we find that, as p\ increases, success probabilities also increase, but the effect of P2 is more prominent 
than that of p\. On the other hand, success probabilities also change with p w and p c . Since P2 — means P2 = p w , 
so there is little variability in the likelihood values leading to larger size of A/'max- 

When p w — 0.01, effect of P2 on S^S^S^ and A3, A4, A5 seems to be significant, whereas the same cannot be 
said for p w = 0.001. There is sudden change in SVs and Aj's, when we shift from P2 — to P2 = 0.3, for p w = 0.01, 
but not p w = 0.001. So, when p w is small, the effect of the neighborhood seems to be less. 

The values of S3 and S4 are very similar for different values of the parameters; but large increment in A4 than 
A3 suggests that the idea of neighboring search is not effective. But S3 is much higher than S2', so the method of 
searching all the nodes in N max is a better idea than that of searching a random node from A/" ma x- 

We estimate the success probability Pmj^(Ln > C.L max ) by simulation for different values of the threshold C 
ranging from 0.5 to 0.9 (see Table [2|). Note that S5 corresponds to the threshold value C — 0.9. We consider 
pi = 0.99, p w = 0.001, p c = 0.9 and four values of P2 = 0.3,0.4,0.5,0.6. From Table [21 we see that the success 
probability increases as the threshold value C decreases and P2 increases. Similarly, the number of search decreases 
with both C and p2- 

Table 2: Simulation of estimated success probabilities and number of searches for different threshold values (C) and 
some values of the parameters with p c ~ p\ = 0.9 



other parameters 


C = 0.6 


C - 0.7 


C = 0.8 


C = 0.9 


P2 




success 


search 


success 


search 


success 


search 


success 


search 


0.0 


0.01 


0.81 


18.21 


0.81 


18.25 


0.81 


18.21 


0.81 


18.17 


0.3 


0.01 


0.87 


13.72 


0.78 


9.13 


0.75 


6.64 


0.73 


5.70 


0.4 


0.01 


0.89 


8.86 


0.85 


6.47 


0.82 


5.46 


0.80 


5.12 


0.5 


0.01 


0.93 


6.88 


0.90 


5.69 


0.89 


5.05 


0.86 


4.88 


0.6 


0.01 


0.97 


5.27 


0.96 


4.91 


0.93 


4.04 


0.92 


3.90 


0.0 


0.001 


0.80 


3.27 


0.80 


3.21 


0.80 


3.17 


0.80 


3.18 


0.3 


0.001 


0.91 


4.15 


0.91 


3.65 


0.87 


3.31 


0.86 


3.06 


0.4 


0.001 


0.94 


4.25 


0.94 


3.69 


0.93 


3.31 


0.89 


3.03 


0.5 


0.001 


0.97 


4.24 


0.97 


3.64 


0.96 


3.26 


0.93 


2.96 


0.6 


0.001 


0.99 


3.96 


0.98 


3.18 


0.98 


3.04 


0.96 


2.79 
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Table 3: Values of Pi, P2, c and d for p c = 0.9 



parameters 


p m = 0.1 


Pw = 0.2 


Pi 


P2 


Pi 


P 2 


c 


d 


Pi 


P2 


c 


d 


0.7 


0.3 


0.66 


0.34 


1.865 


0.202 


0.69 


0.41 


2.139 


0.298 


0.4 


0.66 


0.42 


1.526 


0.234 


0.69 


0.48 


1.674 


0.330 


0.5 


0.66 


0.50 


1.302 


0.268 


0.69 


0.55 


1.378 


0.363 


0.6 


0.66 


0.58 


1.135 


0.302 


0.69 


0.62 


1.166 


0.397 


0.8 


0.3 


0.74 


0.34 


2.114 


0.202 


0.76 


0.41 


2.484 


0.298 


0.4 


0.74 


0.42 


1.730 


0.234 


0.76 


0.48 


1.944 


0.330 


0.5 


0.74 


0.50 


1.476 


0.268 


0.76 


0.55 


1.600 


0.363 


0.6 


0.74 


0.58 


1.287 


0.302 


0.76 


0.62 


1.354 


0.397 


0.9 


0.3 


0.82 


0.34 


0.241 


0.202 


0.83 


0.41 


2.907 


0.298 


0.4 


0.82 


0.42 


1.981 


0.234 


0.83 


0.48 


2.275 


0.330 


0.5 


0.82 


0.50 


1.690 


0.268 


0.83 


0.55 


1.873 


0.363 


0.6 


0.82 


0.58 


1.474 


0.302 


0.83 


0.62 


1.584 


0.397 



7 Discussion 

One prime object of this paper is to show the effect of the neighboring nodes in detection of an event. In this section, 
we discuss the role of the neighboring nodes, some other related issues and make remarks. 

7.1 Role of the neighboring nodes 

Since lnTTjv = & + b{cz^ + tjv — dk(N)), where a, b, c and d are as defined in Section 14.11 c denotes the weight of 
the central node compare to the neighboring nodes in the corresponding likelihood. Note that, since Pi > P2, we 
have c > 1 and, if c is close to 1, then the six neighboring nodes are as important as the event node. So, as the 
value of c increases, the importance of the neighboring nodes decreases. Also, d gives some idea about the role of the 
number of adjacent nodes, i.e., k(N). Recall that Pi and Pi are the probabilities of responding (i.e., reporting the 
node TV as the event hexagon) by the sensors S*jv and Sn>, respectively, when TV is the event hexagon and TV' is a 
neighboring node of TV. So, we numerically calculate the quantities Pi, P2, c and d for some values of the parameters 
(see Table gj). 

From the theoretical results in Section 14.11 we see that, Pi and c increase as p% increases, while P2 and d do 
not depend on p\. On the other hand, while P2 increases with P2, c and d decreases and Pi is independent of 
P2- Therefore, the importance of the neighboring nodes decreases with p\ and increases with p2, as expected and 
observed in Table [3] 

7.2 Estimation of the parameters 

In practice, the parameters p\ 1 p2 1 p w and p c may be unknown. We can, however estimate the parameters by some 
experimentation. 

Note that, under A4o, Z^ follows Bei(p w ) for all TV 6 P. Hence, p w is the expected value of Zjy given Mq- So 
we perform the experiment by keeping the ROI normal. The proportion of Z/v's having value 1 gives an estimate of 
p w . Repeat this experiment several times so that the average of the proportions over the repeated experiments can 
be taken as an estimate of p w . 

Note that, p\ is the expected value of Yjy under Mn- So, we perform the experiment by keeping an event in 
some node TV of the ROI. The proportion of Yjv's having value 1 gives an estimate of p\. Repeat this experiment 
for several times so that the average of the proportions over the repeated experiments can be taken as an estimate 
of p\. Similar experiments will give estimates of P2 and p c as well. 

7.3 Incorporation of heterogeneity and uncertainty in parameters 

Let = (pi,p2,Pc,Pw) denote the set of parameters, which has been assumed to be the same for all the nodes. While, 
in practice there is no reason why the parameters should be same for all the nodes, it is also not clear how these 
would be different across TV. This unexplained heterogeneity can be incorporated by assuming the #'s, for different 
TV, to be independent realizations from a common distribution. 
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Let On = {pin,P2N,PcN,Pwn) denote the set of parameters for node N. We assume that 9n,N £ i?, are i.i.d. 
from some distribution, say, g(9). Also assume that, given On, N £ R, Zn's are independent. Note that g{8) denotes 
the joint distribution of the four parameters. For simplicity, we may assume them to be independent so that g(9) 
can be written as g{0) = gi{p\)g2{p2)gc{Pc)gw{Pw) ■ In this situation, the likelihood for the model .Mo is 



where the integration is over the range of p W N- Similarly, the likelihood for the model A4n can be written as 



where the integral is over the four-dimensional space given by the range of On, and L N {On) is the contribution of 
the iV'th node to the likelihood Ln, given the value 9n, as described in Section |4~T1 

Similar technique can also be used to incorporate parameter uncertainty. Even though the parameters can be 
assumed to be same for all the nodes, there may be reasonable uncertainty about the constancy of the parameter 
values. As in the Bayesian paradigm, the set of parameters may be assumed to be a realization from a distribution, 
say, g{9). Then, the likelihoods for the model Mq and Mn are 



The choice of g{9) may be a difficult one. However, sometimes there may be specific information available regarding 
the distribution of 9, which can be incorporated in the model. 

7.4 When more sensors can detect the event square 

We may consider the situation when sensing radii are larger and more sensors can detect the event hexagon but with 
different probabilities. With respect to a particular node, classify the remaining nodes with respect to the probability 
of detecting the event at that node, which may as well depend on the distance from the particular node. Suppose 
that the sensors in the i-th class detect the event hexagon with probability pi,i = 1,2,3,.... The theoretical analysis 
is similar to that of Section [31 but having more probability terms. 

7.5 Concluding Remarks 

In this paper, we consider the problem of fault detection in wireless sensor network (WSN). We discuss how to address 
both the noise-related measurement errors {pi and P2) and sensor fault (p c and p w ) simultaneously in fault detection, 
where the ROI is partitioned into regular hexagons with the event occurring at only one hexagon. We propose 
fault detection schemes that explicitly introduce the error probabilities into the optimal event detection process. We 
develop the schemes under the consideration of model selection technique, multiple model selection technique and 
Bayesian model averaging method. The different error probabilities are calculated by means of simulation. Note that 
the same analysis can be carried out when ROI is partitioned into squares and sensors are placed at the centers. 

Nandi et al. |16) consider similar problem in wireless sensor network (WSN) , in which the event can take place at 
the center of one particular square (or hexagon) of the grid covering the ROI. In our paper, we allow the event square 
to be any one in the grid. Our approach can also be used for the problem of |16| with only two models to be considered 
for selection. In Q15], the authors develop the scheme under the consideration of Neyman- Pearson hypothesis test, 
where the null and alternative hypotheses correspond to the two models. In model selection approach, we select the 
model with higher likelihood. In classical Neyman-Pearson hypothesis test, a model is selected if it's likelihood is 
greater than some constant times the likelihood of the other. This constant is fixed before the test depending on the 
size of the test. In model selection approach, the constant is 1, leaving no choice for the size of the test. On the other 
hand, we cannot apply the classical Neyman-Pearson test with more than two models to be considered for selection. 

The principle of hypothesis testing places a large confidence in the null hypothesis and does not reject it unless 
there is strong evidence against it. This safeguard of null hypothesis cannot be ensured in the model selection 
approach of Section 14.11 However, the multiple model selection approach of Section 14.21 provides some safeguard in 
this regard. 



u NeR / p 7 w N N ^-PwN) (1 ~ ZN) g w {p W N)dp: 



>wN, 
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This principle of model selection can be extended to the situation when there are more than one event hexagon 
and the objective is to detect the event hexagons. We may also assume that the sensors can detect different types 
of events. That is, response of sensors may not be only binary; sensors can measure distance, direction, speed, 
humidity, wind speed, soil makeup, temperature, etc., and send the measurement of continuous type variables to the 
base station. One needs a different formulation of the problem in such case which will be taken up in future. 
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